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ABSTRACT 

A single-screen model of the gravitational lens system 2016+112 is proposed, that ex- 
plains recent Hubble Space Telescope (HST) infrared (NICMOS-F160W) observations 
and new high-resolution European VLBI Network (EVN) 5-GHz radio observations, 
presented in this paper. In particular, we find that a massive 'dark' structure at the 
lens position, previously suggested by X-ray, optical and spectroscopic observations of 
the field around 2016+112, is not necessarily required to accommodate the strong lens- 
ing constraints. A massive structure to the north-west of the lens system, suggested 
from a weak-lensing analysis of the field, is included in the model. The lensed source is 
an X-ray bright active galaxy at z=3.273 with a central bright optical continuum core 
and strong narrow emission lines, suggestive of a type-II quasar. The EVN 5-GHz 
radio maps show a radio-jet structure with at least two compact subcomponents. We 
propose that the diamond caustic crosses the counter-jet of the radio source, so that 
part of the counter-jet, host galaxy and narrow-line emission regions are quadruply 
imaged. The remainder of the radio source, including the core, is doubly imaged. Our 
lens model predicts a very high magnification (/i~300) at the bightness peaks of the 
inner two radio components of complex C. If the jet exhibits relativistic velocities 
on micro-arsecond scales, it might result in apparent hyperluminal motion. However, 
the lack of strong radio variability and the peaked radio spectrum imply that these 
motions need not be present in the source. Our model furthermore implies that the 
optical spectrum of C can only show features of the AGN and its host galaxy. 

Key words: Gravitational lcnsing - quasars: general - radio continuum: general 



1 INTRODUCTION 

The gravitational lens system 2016+112, discovered by 
Lawrence et al. (1984), has defied any simple explanation. 
The system consists of two AGN images (A and B) at a 
redshift of z— 3.273 (Lawrence et al. 1984; Schneider et al. 
1985, 1986), separated by 3.4 arcsec. Early optical and near- 
infrared observations (e.g. Schneider et al. 1985; Langston, 
Fischer & Aspin 1991; Lawrence, Neugebauer & Matthews 
1993) showed the presence of two extended objects (desig- 
nated ffl'^ and D). C' lies sume 2 aicsec suuLh-easL uf im- 
age B, such that ABC' nearly form a right triangle. Objcet 
D, the primary lens galaxy, has a redshift of 1.01 (Schnei- 
der et al. 1985) and is close to the centroid of A, B and 

* The prime distinguishes it from the radio structure (desig- 
nated C), which early on was not known or thought to be as- 
sociated with the optical emission. 



C'. The redshift of C' has resisted measurement. Narrow 
Ly-a emission near A, B and C' has been detected at a 
redshift of 2=3.273 with line widths ^1000 kms" 1 (Schnei- 
der et al. 1986, 1987; Lawrence 1996; Yamada et al. 2001; 
Lawrence et al. 2002). Similarly, two fuzzy patches of Ly- 
cx emission were detected about 3 arcsec north-west and 
west of images A and B, respectively (Schneider et al. 1986, 
1987). Recently, high-resolution F814 W-band (Lawrence 
et al. 2001) and F160W-band observations we re obtained 
with the HST (see the CASTLES webp age frttp://cfa- 
www.harvard.edu/glensdata/MG2016.htni] ; e.g. Munoz et 
al. 1998), showing that images A and B are unresolved and 
compact, whereas C' is arc-like and does not show obvious 
compact structure. 

At radio wavelengths, 2016+112 has been observed with 
the VLA (e.g. Lawrence et al. 1984; Schneider et al. 1985) 
and displays three components, A, B and C. All components 
contain compact substructure in higher resolution MER- 



© 2001 RAS 



2 Koopmans et al. 



LIN (Garrett et al. 1994), VLBI (Heflin et al. 1991) and 
EVN (Garrett et al. 1996) images. Whereas components A 
and B consist of at least two subcomponents (oriented ap- 
proximately north-west), radio component C splits into four 
dominant subcomponents aligned east-west (e.g. Garrett et 
al. 1996). Components A and B have steep integrated ra- 
dio spectra between 18 cm and 6 cm (Lawrence et al. 1994), 
whereas component C has a somewhat flatter integrated ra- 
dio spectrum. All component spectra steepen between 6 and 
2 cm (Langston et al. 1991). The overall integrated spec- 
trum of the lens system is that of a GPS source and peaks 
somewhere in the range of 1 to 5 GHz. With the VLA in 
A-array, the source is barely detectable at 22 GHz (Patnaik, 
private communication). Observations with EVN at 18cm 
and MERLIN at 6 cm (Garrett et al. 1996) show that the 
outermost images (Cn and C2) have significantly steeper 
spectra than the two innermost components (C12 and C13). 
These observations suggest that the structures C11+C12 and 
C13+C2 have opposite parities and are probably images of 
the same structure in the source. 

Observations of the field around 2016+112 with the 
ASCA satellite X-ray Observatory suggested the presence of 
diffuse X-ray emission centered on the lens system and to the 
north-west (Hattori et al. 1997; Benitez et al. 1999). Recent 
observations with the Chandra X-ray Observatory, however, 
unambiguously show that this emission is mostly due to 
discrete sources and not diffuse cluster emission (Chartas 
et al. 2001). In addition, the lensed images A and B, and 
complex C', are detected. The presence of X-ray and nar- 
row emission lines suggests that the source could in fact be 
a type-II quasar (e.g. Yamada et al. 2001; Chartas et al. 
2001). None of the other X-ray sources in the field are as- 
sociated with galaxies detected by Soucail et al. (2001; see 
below). The comparatively large number of X-ray sources in 
the field of 2016+112 might be the result of a magnification 
bias, which enhances the number count of sources around 
the lensing mass distribution if the number density of X-ray 
source increases steeply at redshifts larger than that of the 
lens galaxy. 

Spectroscopic observations by Soucail et al. (2001) show 
the presence of an overdensity of galaxies at the same red- 
shift as lens galaxy D. Similarly, Clowe et al. (2001) detect a 
3-<r weak-lensing signal to the north-west of the lens system, 
roughly coincident with the region of excess X-ray emission 
(e.g. Benitez et al. 1999) . The latter could again be due to a 
magnification bias. Only a marginal weak-lensing signal at 
the position of the lens system was found, consistent with 
the absence of a very massive dark X-ray cluster. If the dis- 
persion in velocities of the field galaxies, found by Soucail 
et al. (2001), are representative of a virialized system, one 
would have expected to see diffuse X-ray emission (Chartas 
et al. 2001). Absence of the latter therefore also suport that 
these galaxies have not yet virialized and formed a massive 
centrally concentrated cluster. 

Several models have been proposed to explain these 
observations. Some employ a single deflector and a single- 
screen (Langston et al. 1991; Benitez et al. 1999), whereas 
others used more complex models with C' and D being dif- 
ferent galaxies (Narashima et al. 1987). In the proposed two- 
screen models it is assumed that object C' is a galaxy at a 
redshift different from galaxy D, and responsible for lensing 
radio complex C into additional 'subimages' (Nair 1993; Nair 



& Garrett 1997). The mirror-symmetry and opposite pari- 
ties for the structures C11+C12 and C13+C2 suggest that 
the source structure corresponding to complex C is quadru- 
ply imaged and that there is no need to invoke a second lens 
screen, that is not a weakly perturbative, but changes the 
nesting of the caustic curves in the sources plane. However, 
a two-screen model or a model that has two lens galaxies in 
the same lens plane, predicting the same parities as a single- 
screen single-lens model (Nair & Garrett 1997) can of course 
not be excluded based on this argument alone. 

In this paper we propose an alternative model that 
can quantitatively explain these observations with a single 
screen. Instead of explaining the lens system with a com- 
plex deflector model, we find it can also be explained by a 
realistic, although more complex source model. In Sect. 2, we 
present EVN 5-GHz radio observations of 2016+112, which 
suggest that complex C consists of two images with opposite 
parities. In Sect. 3, a detailed structure of the source is pro- 
posed which qualitatively can explain most of the observed 
features of the 2016+112 lens system. In Sect. 4, we present 
a model of the lens potential that incorporates mass struc- 
tures from the field around the lens system. In Sect. 5, we 
compare the model to the observational constraints to find 
that it can also quantitatively explain the available observa- 
tional constraints of the lens system and the field. In Sect. 6, 
our results are summarized and discussed. Throughout this 
paper we assume a flat smooth Friedman- Robertson- Walker 
(FRW) universe with fi m =0.3 and H =65 kms -1 Mpc~\ 



2 EVN 5-GHZ OBSERVATIONS AND DATA 
ANALYSIS 

The 5-GHz EVN observations were made on 17-18 May 
1995 from UT 22h30 to 10h30, using 7 antennas of the 
EVN: the 100-m Effelsberg (DE), 26-m Jodrell-Mk2 (UK), 
25-m Onsala (SE), 32-m Medicina (IT), 32-m Noto (IT), 32- 
m Cambridge (UK), and Westerbork-array (NL). Technical 
problems resulted in no data from the latter two antennas. 
The recording mode was Mkllla Mode A (56 MHz band- 
width, Left Hand Circular polarisation). A phase-reference 
observing scheme was used, switching between the target, 
2016+112, and a compact calibrator, J2029+121, located 
1 degree away. The tape ran continuously through each 13 
minute pass, and the source switching cycle consisted of al- 
ternating between 90 seconds on the calibrator and 140 sec- 
onds on the target. Correlation of the data was conducted 
at the MPIfR Mkllla correlator in Bonn. To prevent loss 
of data during correlator synchronisation of the very short 
scans, each baseline was correlated twice as a continuous 
tape pass, once at each of the two source positions. (Spuri- 
ous correlations on the "wrong" source were edited out in 
the later data analysis) . 

Subsequent data processing and analysis was performed 
with the NRAO AIPS package. Long fringe-fitting solu- 
tions were made that included several switching phases of 
the calibrator; the resulting phase, delay and rate solutions 
were then applied to 2016+112. Although the interferome- 
ter model used in the MPIfR correlator was not sufficiently 
accurate to permit direct phase-referencing, this process did 
remove short-term phase fluctuations, and allowed longer so- 
lution intervals to be used for self-calibration of the phases of 
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Figure 1. European VLBI Network (EVN) 5-GHz radio images of regions A, B and C of 2016+112. Contour levels indicate 
(—3, 3, 5, 10, 20) times the rms noise level of 0.14 mjy beam . One notices the jet-like substructure in images A and B, and the apparent 
'mirror-symmetry' of components C12 and C13, suggesting opposite parties due to lensing. The model proposed in this paper assumes 
that both components arc images of a quadruply-lensed part of the source, which is associated with the counter-jet in components A2 and 
B2, the core of the source. Components C11 and C2 are presumably steep-spectrum images of intrinsically faint (but highly magnified) 
substructure further along the counter-jet (i.e. west of A2 and B2). The restoring beam has a FWHM of 6 milli-arcsec. 



the 2016+112 visibilities. Amplitude calibration was made 
by initially assuming that J2029+121 is a 0.91-Jy point 
source on all baselines, and then determining self-calibration 
corrections after mapping the source. These amplitude and 
phase corrections were also applied to 2016+112. In order to 



prevent fringe-rate and delay smearing over the 4 arcsecond 
field of view, the data associated with 2016+112 were main- 
tained as 28 contiguous 2 MHz channels with a visibility av- 
eraging time of 2.5 seconds. The calibrated 2016+112 data 
were Fourier transformed and a naturally weighted, tapered 
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image of the full field was produced. All three main regions 
of emission A, B and C were clearly detected and the image 
was used as an input model for subsequent (phase-only) self- 
calibration. Since 2016+112 is a rather faint, resolved radio 
source, improvements to the original phase solutions were 
only obtained by employing a relatively long solution inter- 
val (13 minutes) over the entire frequency band (56 MHz). 
CLEANed maps of the three main regions of emission are 
presented in Fig.l. All the maps are naturally weighted (the 
rms noise level is ~140 fiJy beam - ) and the FWHM of the 
circular restoring beam is 6 mas. 

To obtain the positions and flux-densities of the radio 
components in regions A, B and C, elliptical Gaussians were 
fitted to the images. Images A and B were fitted by two 
Gaussians. Images C12 and C13 could not be fitted by single 
elliptical Gaussians and were therefore each fitted by two 
Gaussians (a and b, respectively). The results of these fits 
are listed in Table 1. 



3 THE SOURCE 

Based on the optical/IR and radio observations of 2016+112 
(see Sections 1 and 2), we postulate the following simple 
picture for the lensed source (see also Langston et al. 1991 
and Benitez et al. 1999 for analogous models). First, we 
assume that optical continuum emission of images A and B 
is from an AGN (possibly a type-II quasar) near the core of 
some faint underlying host galaxy, which is not detected at A 
and B due to its very low surface brightness compared to the 
AGN. Second, the NW elongation of the EVN 1.7 GHz radio 
images of A and B (Garrett et al. 1996), is now confirmed 
by the high-resolution EVN observations at 5 GHz presented 
in Sect. 2, showing that both components consist of at least 
two subcomponents. We assume that the flatter-spectrum 
subcomponents seen at 5 GHz (A2 and B2) are associated 
with the optical continuum core and the other two (Ai and 
Bi) are jet-features. Because the constraints from the optical 
emission are not used at this level of positional accuracy (few 
mas), this particular choice is of little relevance for the lens 
modeling. 

Additionally, strong spectral lines (e.g. Ly— a, C-IV, N- 
V, Si-IV) are seen near images A, B and C' (e.g. Schneider 
et al. 1986, 1987; Lawrence 1996; Yamada et al. 2001). How- 
ever, the line widths are very narrow compared to a typical 
quasar spectrum drawn from the same redshift range (see 
Steidel et al. 2001). This is a puzzle, although it could be 
that line emission from the BLR is obscured, as suggested 
by Yamada et al. (2001), and that only the NLR further 
from the core is seen. This would also explain why the Ly-a 
emission is not exactly coincident with A, B and C', but 
offset from C' by 1 WFPC2 pixel (0.0455") when aligned 
with A and B. These narrow-line spectra are indicative of a 
type-II quasar (Yamada et al. 2001), which is supported by 
the X-ray brightness of the source (Chartas et al. 2001). A 
schematic picture of the source is shown in Fig. 2. 

Qualitatively, this source structure can explain both the 
optical and radio data of 2016+112 available to date, if the 
diamond caustic (fold) crosses the source as indicated in 
Fig. 2: (i) the optical core is doubly imaged at A and B, be- 
cause it falls outside the diamond caustic, (ii) most of the 
associated radio structure is also doubly imaged near A and 
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Figure 2. A schematic view of the proposed source structure for 
2016+112. The AGN, associated radio core, part of host galaxy 
and the second radio component are doubly imaged. The western- 
most part of the radio core and part of the host galaxy are quadru- 
ply imaged. The components are not drawn to scale, nor do the 
cllipticity and position angles of host galaxy and other structures 
necessarily represent those of the true underlying source. 



B, (iii) only part of the radio structure, associated with the 
counter-jet near radio subcomponents A2 and B2, is quadru- 
ply imaged, such that C12 and C13 are fainter than what one 
might expect based on their proximity to the critical curve 
(i.e. the high magnification) and the flux-density of A2 and 
B2, (iv) components C11 and C2 are steep-spectrum emission 
further along the counter-jet, which are quadruply imaged 
but only detected near C because of the very high magni- 
fication near the critical curve, (v) part of the underlying 
host galaxy is doubly imaged, but is too faint to be seen at 
images A and B as a result of the bright AGN emission (and 
its associated PSF), (vi) the other part of the host galaxy 
is quadruply imaged, resulting in the highly magnified arc 
(i.e. complex C'), (vii) similarly the extended Ly-a emission 
(Lawrence et al. 2002), as well as the narrow-line emission 
in general (e.g. Yamada et al. 2001) is expected to surround 
the optical core and is part doubly and part quadruply im- 
aged (i.e. that part inside the diamond caustic) and therefore 
seen near A, B and C'. This is supported by the idea that 
the emission-line ratios near C' are more consistent with it 
originating further from the core (Yamada et al. 2001). Al- 
though the proposed model might appear very complicated, 
it is precisely the complex lensed structure one expects from 
AGNs - which are wealthy in wavelength-dependent struc- 
ture on sub-arcsec scales - if they happen to cross a caustic. 

Support for the suggestion that part of the radio source 
crosses the caustic is given by the actual 'merger' of radio 
components C12 and C13 at the few-a level of the surface- 
brightness contours in Fig.l. This only occurs if part of the 
source at that surface-brightness level actually crosses the 
caustic in the source plane. The critical curve crosses pre- 
cisely that point in the image plane where the two images 
merge (i.e. the 'saddle-point' in the surface brightness distri- 
bution between C12 and C13; see e.g. Kochanek, Keeton & 
McCleod 2000). Due to the conservation of surface bright- 
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Table 1. Properties of the image components determined from the 5-GHz EVN data (Sect. 2). Column (1): The component's name as 
in Garrett et al. (1996). Columns (2-3): The positions determined from Gaussian fits to the 5-GHz EVN images. The positional error 
is 1 mas with respect to Ai. Column (4): The total flux-density and error on the last significant digit (within parentheses) of each 
of the components. Column (5): The position angles (PA) of the presumed jet direction, measured in the direction Ai— >A 2 , Bi— >B 2 , 
Ci 2 f,-+Ci 2a ^Cii and Ci3(,— »Ci3 — »C 2 from north to east. Column (6): The parities p±, based on general assumption of the source 
structure and geometry of the time-delay surface (e.g. Blandford & Narayan 1986). 



ness in gravitational lensing, we expect the point where im- 
ages Ci 2 and C13 merge to be accociated with extended 
structure north-west of A 2 and B 2 at the few-cr contour level 
as well (see Fig.l). Images Ci 2 and C13 are associated with 
very compact (i.e. a few /ias) substructure in the counter- 
jet, which unfortunately is barely observable near images A 
and B, due to their relatively low magnifications (see Sect. 5 
below) and of course the finite resolution of the observa- 
tions. This is even more strongly the case for the faint and 
steep-spectrum emission from Cn and C 2 . 

In the model by Langston et al. (1991) the second flat- 
spectrum source component (i.e. the component associated 
with A 2 and B 2 ) lies almost fully inside the diamond caustic, 
whereas the steeper spectrum component straddles the caus- 
tic. Nair & Garrett (1997) showed, however, that this partic- 
ular model predicts the inner two images of complex C to be 
steep spectrum and the outer two flat spectrum, opposite to 
observations. On this basis, the model from Langston et al. 
(1991) was rejected. In the model proposed here, however, 
the steeper spectrum component lies fully outside the dia- 
mond caustic, whereas only a small part of the flat-spectrum 
component straddles the caustic from the outside. This small 
part (i.e. the counter-jet) inside the caustic is quadruply im- 
aged and highly magnified, creating images Ci 2 and C13. 
Note that the direction of merging images Ci 2 and C13 is 
perpendicular to the caustic near the source position (Fig. 3 
below), as must generally be true for a single screen model 
(Blandford & Narayan 1986). Further inside the caustic, we 
postulate that the spectrum of the counter-jet steepens and 
that a small subcomponent is lensed in to images Cn and 
C 2 . Both of these are only seen near C, because of their 
exceedingly high magnifications (Sect. 5). 

This type of lens-source configuration is not uncom- 
mon. Several radio gravitational lens systems have been ob- 
served with part of the source inside and part of the source 
outside the diamond caustic (e.g. Einstein rings). In par- 
ticular, the JVAS/CLASS lens B1938+666 has a radio jet 
structure crossing the diamond caustic near the cusp, al- 
though in this case most of the source lies inside the caustic 
(King et al. 1997), wheres in 2016+112 only the counter-jet 
is quadruply imaged. 

The question is now whether we can also quantitatively 



explain the primary constraints (e.g. radio and optical image 
positions, the jet position angles (PA) and the flux-density 
ratios), and whether the resulting model is in agreement 
with secondary constraints, such as the properties of the 
host galaxy (D) and the detection of one or more nearby 
mass concentrations in weak-lensing (Clowe, Trentham & 
Tonry 2001) and spectroscopic studies (Soucail et al. 2001). 



4 THE LENS & FIELD 

First, we associate a singular isothermal elliptical (SIE) mass 
distribution (e.g. Kassiola & Kovner 1993; Kormann, Schnei- 
der & Bartelmann 1994; Keeton, Kochanek & Seljak 1997) 
with the primary lens galaxy D, although a mass model 
with a steeper or more shallow mass profile is not a priori 
excluded (e.g. Benitez et al. 1999). In light of the present 
uncertainties, especially about external perturbers, we feel 
it is not yet warranted to explore more detailed lens mass 
models. We associate the center of the mass distribution 
(MD) with the centroid of the surface brightness distribu- 
tion (SBD) of galaxy D, measured from the Hubble Space 
Telescope (HST) NICMOS Fi60W-band image (CASTLES; 
e.g. Munoz et al. 1998), i.e. (-1.740+0.003, -1.782+0.003) 
arcsec with respect to image A. In general, good agreement 
is found between position angles of the SBD and MD of 
lens galaxies (Keeton, Kochanek & Seljak 1997). To avoid 
underconstraining the mass model, we have chosen to fix 
the position angle of the MD at the observed SB value of 
—59+2 degrees (see CASTLES web page). The only free pa- 
rameters of the MD of lens galaxy D are therefore the axial 
ratio (/) and the central velocity dispersion (<T| | ; as defined 
in Kormann et al. 1994). 

To model the surrounding field, we place a singular 
isothermal sphere (SIS) mass distribution (designated Mi) 
at the position ( — 24.0, +60.0) arcsec NW of image A, where 
Clowe et al. (2001; see Sect.l) find a significant weak-lensing 
signal. A SIS fit to the weak-lensing shear field suggests a 
velocity dispersion of 9701i|q kms" 1 (1-cr errors; assuming 
a redshift of unity). We constrain the velocity dispersion of 
Mi to this value. In addition, Soucail et al. (2001) have spec- 
troscopically confirmed the presence of an overdensity of at 
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Table 2. The fixed (between brackets) and reconstructed mass-model parameters for the primary lens galaxy D and the two SIS mass 
distributions Mi and M 2 . The arrows indicate how the values change as the velocity dispersion of M 2 is lowered from 560 to 175 kms -1 . 
In addition to the mass distributions, an external shear (7ext) is found with strength 0.12— >0. 07 and position angle — 24°— >— 51°. 



least six in red galaxies at the redshift of lens galaxy D. They 
estimate a velocity dispersion of 77lljg[] kms -1 . Clowe et 
al. (2001) do not to find a strong shear signal at this posi- 
tion (a=560tfgo kms -1 ; 1-cr errors). The low SNR of the 
observations, however, does not allow one to exclude it. We 
include this mass distribution to first order by modeling it 
as a SIS (M 2 ). This is done in order to test whether an ad- 
ditional mass distribution is indeed required by the strong- 
lensing constraints, as suggested by these previous authors. 
We allow the velocity dispersion of M 2 to vary between 560- 
175 kms -1 and let its position be free. The upper limit is 
roughly defined, such that M 2 does not result in additional 
observable images of the background source, although this 
constraint could be lifted if this mass distribution has a large 
finite core and is not capable of multiple imaging. Similarly, 
the lower limit avoids additional sub-images of A, when M 2 
nearly coincides with its position (see Sect. 5). In the latter 
case, the velocity dispersion of M 2 is of galactic scale (few 
hundred kms -1 ), in which case the core radius is typically 
small. We include an external shear with both its strength 
and positional angle as free parameters. 

4.1 Constraints 

As primary constraints on this starting model, we use the 
observed properties of images A, B, Ci 2 and C13, as found 
from the HST NICMOS FWOW-banA (e.g. CASTLES) and 
EVN 5-GHz images (Sect. 2; Table 1). The relative positions 
of A and B in the optical and radio agree to within the mea- 
surement errors. There appears to be a slight offset in decli- 
nation between the Fl 60 W-band emission of complex C' in 
the HST image and the positions of the EVN radio compo- 
nents. It amounts, however, to only 1 pixel (0.1") and in light 
of the uncertain structure of the host galaxy, we consider 
this difference unimportant at present. The optical bright- 
ness ratio between images A and B, found from the HST 
NICMOS F160W-band image is r n =0.97±0.02 (i.e. S B /S A )- 
Garrett et al. (1996), however, find r n =0.84±0.01 at 1.7 
GHz. We therefore adopt an "average" value r n =0.90±0.10 
for the flux-ratio between images A and B, in our lens model- 
ing, where the uncertainty accounts for possible variability, 
even though the source does not appear to vary strongly 
(Haarsma et al. 2001). The adopted value is also consistent 
with r n =0.92±0.05 found from the EVN 6-cm observations, 
presented in this paper (Table 1). 

We do not use the positions of images Cn and C 2 as 
constraints. According to the source model (Sect. 3; Fig. 2), 
they are part of the source structure associated with lensed 
images A 2 and B 2 . They appear somewhat more resolved in 
the EVN 5-GHz observations (Sect. 2), although the signal- 
to-noise is low, and have steeper spectra compared with the 



other images between the EVN 1.7 and 5-GHz. Hence, they 
are probably not part of the same region of the source that 
is responsible for images A2, B 2 , Ci 2a /b and Ci 3a / b seen 
in Fig.l. The emission from Cn and C 2 most likely origi- 
nates further along the counter-jet, which presumably has 
a steeper spectrum (Sect. 3). We assume that the emission 
from Ci 2a and Ci3 a is associated with structure of the unre- 
solved images A 2 and B 2 , within a 1-mas radius from their 
respective centroids. Although this particular choice might 
seem arbitrary, it is probably conservative based on the no- 
tion that from an inverse Compton limit on the brightness 
temperature of these radio sources, between 10 11 and 10 12 K, 
a component size for A 2 and B 2 as small as 0.1-0.3 mas can 
be expected, given their observed flux-densities. If the axial 
ratios of the components are large (i.e. jet-like), the emission 
region could easily 'stretch' to 1 mas or larger in one direc- 
tion. We do not use the flux-density ratios between images 
A 2 , B 2 and complex C, because the source structure lies on 
a caustic and consequently has a strong magnification gradi- 
ent over its extent. The source can therefore not be treated 
as a point source, rendering the use of a flux-density ratio 
very difficult. All constraints are listed in Table 1. 

5 RESULTS 

Using the mass model and constraints discussed above, we 
vary the nine free parameters using the simulated annealing 
downhill simplex method described in Press et al. (1992), 
until the difference between the observed and recovered im- 
age properties are minimized in terms of the x 2_va l ue ( e -g- 
goodncss-of-fit). Using different starting values of the pa- 
rameters, we ensure that the final solution is close to the 
absolute minimum in the x 2_ space. 

We vary the velocity dispersion of M 2 between 560 and 
175 kms -1 (see above) and minimize \ 2 f° r the other free 
parameters. The goodness-of-fit \ 2 (f° r t wo degrees of free- 
dom) of the best models increases only marginally between 
these upper and lower limits on the velocity dispersion of 
M 2 , i.e. from 1.1 to 1.5 respectively. This indicates a strong 
degeneracy in the lens model, between the velocity disper- 
sion and position of M 2 and the external shear. The velocity 
dispersion and axial ratio of Gl are only marginally affected. 
We note that the x 2_ values can not be used to calculate a 
likelihood for the model, because we did not strictly use 
measurement errors for all constraints. The image positions, 
flux-ratio, PA's and parities are recovered in good agreement 
with the constraints (Table 1), given the uncertainties in the 
structure of the lensed images. The recovered image prop- 
erties, inferred magnifications and time-delays are listed in 
Table 3. In Fig.3, we show the image configuration, critical 
curve, caustics and time-delay surface of this model. 
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Arcsec. 

Figure 3. The model of 2016+112 as indicated in Table 2, assuming M2 is an L* galaxy with o-||=225 krns" 1 at (-3.2", -0.4") from 
image A. The thick line indicates the critical curve. The ellipse indicates the position, flattening and position angle of the primary lens 
galaxy (i.e. D), whereas the dashed lines indicate constant time-delay contours. The subpanel (units in arcsec) shows the caustics of 
galaxies D and M2, the source (indicated by the star inside the caustics) and the two Ly-o patches (stars with circles; see Sect. 6). The 
caustics are displaced to the north from the primary lens galaxy by ~10 arcsec due to a massive structure Mi, inferred from weak lensing 
(see text). 



In Table 2, we have listed the recovered parameters of 
the lens galaxy D and two additional mass structures Mi 
and M 2 . We find that the SD axial ratio /=0. 75-0. 76 of 
galaxy D is somewhat larger than that determined from 
its SB distribution, i.e. 0.57 ± 0.01, as seen in the HST 
NICMOS-Fi 60 W observations, (e.g. CASTLES web page; 
see also Benitez et al. 1999) . The velocity dispersion of 320- 
340 kins" 1 of galaxy D implies a rest-frame mass-to- light 
ratio of (M/L)h~1-7-/i65 Mq/Lh,© (no evolutionary correc- 
tion), assuming a singular isothermal sphere to calculate the 
mass inside the Einstein radius, an FJ60W-band magnitude 
for lens galaxy D of 18.12±0.04 (see CASTLES web-page) 



and the galaxy models from Poggianti (1997). This mass- 
to-light ratio compares well with those found for other lens 
galaxies (e.g. Jackson et al. 1998), which indicates that the 
image separation should not be significantly affected by a 
mass contribution with a different mass-to-light ratio (e.g. 
a dark cluster). In that case, the lens galaxy would most 
likely not have been on the Fundamental Plane of early-type 
galaxies either (Kochanek et al. 2000). 

The model also indicates the presence of an external 
shear with a strength 7 between 0.07 and 0.12 and a po- 
sition angle between —24 and —51 degrees, depending on 
the velocity dispersion of M2. If we investigate the I-band 



© 2001 RAS, MNRAS 000, 000-000 



8 Koopmans et al. 



image (Fig. 2 in Soucail et al. 2001) in more detail, there ap- 
pears to be a 'filamentary structure' of high-redshift galaxies 
running across the lens system. This 'filament' is not dy- 
namically related, because it contains galaxies over a very 
different range of redshifts [z ?a0. 6-1.1), but it might be re- 
sponsible for the shear at the position of 2016+112. On the 
other hand, the closeness of the position angle of the exter- 
nal shear and that of the SB of galaxy D, for low velocity 
dispersions of M2, might also indicate that either the mass 
distribution of galaxy D has a different radial mass profile 
or that its flattening is a function of radius. 

Finally, we note the remarkably high magnification 
(/x^300) at the brightness peaks (i.e. not integrated over 
the image) of images C12 and C13, although the precise 
value is sensitive to the details of the model input (see also 
below). For example, the magnification changes by about 
~10% when changing the slope of the radial mass profile by 
~5%. Even so, this is the highest (inferred) magnification 
for any known lens system. Because it is primarily directed 
tangentially and the jet direction is nearly perpendicular 
to the fold caustic, according to the model, any relativis- 
tic motion in the radio jet structure will be enhanced by a 
factor Superluminal velocities of say ^h' 1 c (e.g. Ver- 
meulen & Cohen 1994) could therefore lead to hyperluminal 
velocities of ~10 3 /2+ 1 c. Similarly, one can probe structure 
on intrinsic scales of a few micro-arcsecond when observed 
with VLBI. On the other hand, if the lensed structure is 
part of the counter-jet (see Sect. 3), such high velocities are 
no longer expected (even though velocities of ~ 0.5 h^ 1 c for 
the counter-jet are still likely and could lead to hyperluminal 
velocities of ~ 10 2 hT x c). The absence of strong variability 
in the source (Haarsma et al. 2001) and its GPS-type ra- 
dio spectrum might, however, be indicative of the absence 
of strongly relativistic motion. 

One problem that we have not yet addressed is the 
considerable difference in angular distance between the pair 
components C11-C12 and C13-C2, which we did not include 
in the lens model. Given the high magnification and sym- 
metry around the critical curve, one would expect these dis- 
tances to be similar. The fact that they are not seems to ar- 
gue against our model and in favor of a two lens model (Nair 
& Garrett 1997). However, magnifications and the magnifi- 
cation matrix are a function of differences in higher-order 
derivatives of the local lens potential (see Schneider et al. 
1992). Especially in regions of very high magnification, many 
of these derivatives are very close to zero, because in those 
cases the lensed images form at very shallow extrema of the 
time-delay surface (e.g. Blandford & Hogg 1996). It is there- 
fore not inconceivable that even a miniscule perturbation 
of the local lens potential (e.g. by a globular cluster, halo 
substructure, etc) will have an enormous effect on the local 
magnification matrix, i.e. on the image magnifications and 
the image positions. These discrepancies near critical curves 
have been seen in other lens systems with very high image 
magnifications (Mao & Schneider 1998) and might not be 
uncommon in general. As an example, if we include a typ- 
ical globular cluster with a velocity dispersion cr=7kms _1 
(Einstein radius of <yl mas) about 5 mas away from image 
C2, it changes the magnifications of C2 and Cn consider- 
ably, but also projects them on the same position in the 
source plane. The magnification ratio between C12 and C13 
remains equal to unity within a few percent. The probabil- 



ity of such minor perturbations is considerable, especially 
because one expects hundreds if not thousands of globular 
clusters, dwarf satellites, etc. around these massive elliptical 
galaxies. At higher redshift, according to cosmological CDM 
models, the amount of halo substructure could be even more 
prevalent. We are therefore not too worried about this ap- 
parent discrepancy, but are warned that even though the 
magnifications in complex C are very high, their precise val- 
ues are quite uncertain and should only be taken as indica- 
tive. 



6 DISCUSSION & CONCLUSION 

We have shown that 2016+112 can be explained by a single- 
screen mass model. The observational constraints are rea- 
sonably well reproduced by the proposed model. The ax- 
ial ratio of the surface MD of the lens galaxy D is some- 
what larger than the axial ratio of its SB distribution, as 
determined from HST NICMOS F160W-band observations. 
Also its mass-to-light ratio is in good agreement with that 
of other lens galaxies. Surprisingly, our model is consistent 
with the presence of a massive component about 1' north- 
west of the lens galaxy, suggested from a weak-lensing anal- 
ysis of the field (Clowe et al. 2001). We find evidence for an 
external shear that might result from a 'filamentary struc- 
ture' of high-redshift galaxies running across the lens sys- 
tem roughly from east to west (see Soucail et al. 2001). This 
filament is not a dynamically related structure, because it 
contains galaxies over a wide range of redshifts. The mas- 
sive dark cluster previously suggested can not be confirmed 
or excluded, based on the strong-lensing constraints. In par- 
ticular, a single ~ L* galaxy several arcsec east of galaxy 
D (see Fig. 3) could also be consistent with the strong lens- 
ing constraints. A faint object at that approximate position 
does appear on a deep R-band image, i.e. Fig. 4 in Clowe 
et al. (2001), although we do not know whether that object 
is indeed a galaxy and what redshift is has (i.e. whether it 
forms a second lens screen). Given the high redshift of the 
source, a perturbing galaxy at that position could have a 
wide range of redshifts (0.5-1.5), not necessarily being that 
of the primary lens galaxy. 

Because we can explain the available data on 2016+112 
with a single-screen model, we conclude that a second screen 
is at most only a pertrubation and that the optical/IR ob- 
ject C' is a highly magnified arc of the AGN host galaxy 
at z=3.273. Thus, we predict that optical spectroscopy of 
complex C' will not yield a redshift in between that of lens 
galaxy D and the source as previously suspected, but will 
instead only show features at the source redshift. 

Our model predicts a very high magnification (/i~300) 
at the brightness peaks of images C12 and C13. This can 
be expected based on the proximity of these images (only 
~20mas; see Fig.l) to the critical curve passing in between 
them. Because, the magnification is inversely proportional to 
the separation between the two images (e.g. Schneider et al. 
1992, Chapt.6), the small separation (~40mas) between the 
images results in an order of magnitude higher magnification 
than normally observed. This is enhanced by the small angle 
between the line joining the images and the caustic, resulting 
in an even smaller distance of the images to the caustic (i.e. 
a higher magnification). A magnification that is 1-2 orders 
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Comp. 


R,A (a,rcssc) 


Dec (a,rcs6c) 


r {B,C}/A 




Af/Ji.. fringe '\ 


Ai 


+0.0008/0.0008 


-0.0002/0.0000 


[1.0] 


+4.0/3.3 


=0.0 


A 2 


-0.0121/0.0121 


+0.0040/0.0040 


[1.0] 


+4.0/3.3 


3.3/9.1 


Bi 


-3.0060/3.0060 


-1.5036/1.5033 


0.91/0.85 


-3.6/2.8 


255.3/291.6 


B 2 


-3.0126/3.0127 


-1.4979/1.4981 


0.90/0.84 


-3.6/2.8 


254.9/286.4 


Cl26 


-2.0542/2.0542 


-3.2294/3.2292 


91.8/77.4 


-367/252 


224.1/254.6 


Cl36 


-2.0855/2.0855 


-3.2254/3.2257 


92.5/78.2 


+369/255 


224.1/254.6 



Table 3. Recovered image parameters (see Table 1). The flux ratios and magnifications (assuming point-source structures) are given 
by r{ B ,c}/A an d M> respectively, and the time-delays arc given by At with respect to the leading image Ai (assuming the cosmological 
model mentioned in Sect.l). The first and second values indicate those for <tmi=560 and 175 kms -1 , respectively. The recovered source 
positions for {Ai, Bi} and {A 2 , B 2 , C 12b , C 13b } are (1.214/5.010", 10.143/9.670") and (1.220/5.003", 10.144/9.669"), respectively. 
Even though, the change in source position is large, when changing <tmi , this is no relevance because it not an observable. 



of magnitude larger than normally seen for two merging im- 
ages (/i~10), can therefore be expected. This magnification 
could lead to observable hyperluminal motion with veloci- 
ties of order ~10 3 h' 1 c of micro-arsec-scale structure in the 
lensed jet, although the GPS-type radio spectrum and low 
variability of the source at present do not support the notion 
of high apparent velocities in the radio jet, especially also 
because our model suggests that it is the counter-jet that is 
being magnified. 

Our source model appears to be that of a type-II quasar 
(e.g. Yamada et al. 2001; Chartas et al. 2001). At radio 
wavelengths, only the counter-jet of the source is quadru- 
ply imaged, whereas the AGN core and radio jet are dou- 
bly imaged. The host galaxy and the extended narrow-line 
emission around the core is part doubly and part quadruply 
imaged, which explains its offset from the optical and ra- 
dio structures and also the difference in line ratios between 
images A & B and complex C (Yamada et al. 2001). The 
absence of BLR emission and the presence of X-ray emis- 
sion (Chartas et al. 2001) further support the identification 
of this source as a type-11 quasar. 

Finally, we suggest that the two patches of Ly-a emis- 
sion found by Schneider et al. (1986, 1987) could be cold gas 
clouds in the 1GM - possibly around some nearby galaxies - 
that are illuminated by the emission cone coming from the 
AGN along the jet axes. There are indeed several objects 
at the positions of these patches (Soucail et al. 2001; Clowe 
et al. 2001), of which the brightest has the same redshift as 
the source (Soucail et al. 2001). When projecting the two 
patches on the source plane (Fig.3), using the mass model 
presented in this paper, we find that one lies close to the 
(counter) jet axis in the source plane, i.e. ~20°, whereas the 
second patch makes an angle of ~50°. Their distances to 
the AGN in the source plane are 1.9 and 1.5 arcsec, respec- 
tively. They are therefore not multiply imaged. If the jet is 
pointed towards (or away from) the observer, the real angles 
between the patch, AGN and observer could be smaller. The 
low expected metalicity for such IGM clouds could explain 
the presence of strong Ly-a emission lines and the absence 
of strong metal lines, such as the CIV lines seen near im- 
ages A, B and C' (Schneider et al. 1987; Lawrence 1996). 
If these patches are indeed illuminated by the AGN, we ex- 
pect this Ly-a emission to be highly polarized. Polarimetry 
observations on a 10-m class telescope could confirm this. 

The ultimate test between the different one and two-lens 
models will be the measurement of the redshift of spectral 
(emission and absorption) lines from complex C', which can 



only be z «3.273, according to the model proposed in this 
paper. 
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